Fitting Ranked Linguistic Data with Two-Parameter Functions
نویسندگان
چکیده
It is well known that many ranked linguistic data can fit well with one-parameter models such as Zipf’s law for ranked word frequencies. However, in cases where discrepancies from the one-parameter model occur (these will come at the two extremes of the rank), it is natural to use one more parameter in the fitting model. In this paper, we compare several two-parameter models, including Beta function, Yule function, Weibull function—all can be framed as a multiple regression in the logarithmic scale—in their fitting performance of several ranked linguistic data, such as letter frequencies, word-spacings, and word frequencies. We observed that Beta function fits the ranked letter frequency the best, Yule function fits the ranked word-spacing distribution the best, and Altmann, Beta, Yule functions all slightly outperform the Zipf’s power-law function in word rankedfrequency distribution.
منابع مشابه
Classical and Bayesian Inference in Two Parameter Exponential Distribution with Randomly Censored Data
Abstract. This paper deals with the classical and Bayesian estimation for two parameter exponential distribution having scale and location parameters with randomly censored data. The censoring time is also assumed to follow a two parameter exponential distribution with different scale but same location parameter. The main stress is on the location parameter in this paper. This parameter has not...
متن کاملCharacterizing Ranked Chinese Syllable-to-Character Mapping Spectrum: A Bridge Between the Spoken and Written Chinese Language
One important aspect of the relationship between spoken and written Chinese is the ranked syllable-tocharacter mapping spectrum, which is the ranked list of syllables by the number of characters that map to the syllable. Previously, this spectrum is analyzed for more than 400 syllables without distinguishing the four intonations. In the current study, the spectrum with 1280 toned syllables is a...
متن کاملAn Application of Discounted Residual Income for Capital Assets Pricing by Method Curve Fitting with Sinusoidal Functions
The basic model for valuation of firm is the Dividend Discount Model (DDM). When investors buy stocks, they expect to receive two types of cash flow: dividend in the period during which the stock is owned, and the expected sales price at the end of the period. In the extreme example, the investor keeps the stock until the company is liquidated; in such a case, the liquidating dividend becomes t...
متن کاملThe early plasma concentration of 51Cr-EDTA in patients with cirrhosis and ascites: a comparison of three models
OBJECTIVES The aim of the study was to determine which of three two-parameter fitting functions (exponential, linear-log, and negative-power function of time) most accurately models early chromium-51-EDTA (51Cr-EDTA) plasma concentration data prior to 120 min in patients with cirrhosis and ascites and understand how these fitting functions affect the calculation of the area under the plasma con...
متن کاملThree-parameter Kappa distribution and its fitting to the whole monthly rainfall data of Abali station in Tehran province
Kappa distribution is a positively skewed distribution which is used in analyzing precipitation, wind speed and streamflow data. In this paper, first a three-parameter Kappa distribution that introduced by Park et al. (2009) is studied and then four different methods of estimation including Moments, L-Moments, Maximum Likelihood and Maximum Product Spacing Method are presented in order to estim...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Entropy
دوره 12 شماره
صفحات -
تاریخ انتشار 2010